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Abstract. We study a long standing conjecture on the necessary and 
sufficient conditions for the compatibility of multi-state characters: There 
exists a function f(r) such that, for any set C of r-state characters, 
C is compatible if and only if every subset of f(r) characters of C is 
compatible. We show that for every r > 2, there exists an incompatible 
set C of [§J • r§] + 1 r-state characters such that every proper subset of C 
is compatible. Thus, f(r) > [§J • + 1 for every r > 2. This improves 
the previous lower bound of f(r) > r given by Meacham (1983), and 
generalizes the construction showing that /(4) > 5 given by Habib and 
To (2011). We prove our result via a result on quartet compatibility that 
may be of independent interest: For every integer n > 4, there exists an 
incompatible set Q of L 11 ^] ■ f^^i^l + 1 quartets over n labels such that 
every proper subset of Q is compatible. We contrast this with a result on 
the compatibility of triplets: For every n > 3, if R is an incompatible set 
of more than n — 1 triplets over n labels, then some proper subset of R is 
incompatible. We show this upper bound is tight by exhibiting, for every 
n > 3, a set of n — 1 triplets over n taxa such that R is incompatible, 
but every proper subset of R is compatible. 

Keywords: phylogenetics, quartet compatibility, character compatibil- 
ity, perfect phylogeny 



1 Introduction 

The multi-state character compatibility (or perfect phylogeny) problem is a ba- 
sic question in computational phylogenetics 18 . Given a set C of characters, 
we are asked whether there exists a phylogenetic tree that displays every char- 
acter in C; if so, C is said to be compatible, and incompatible otherwise. The 
problem is known to be NP-complete |3l21j , but certain special cases are known 
to be polynomially-solvable |1I6I12I14I15I16I1§] . See [5] for more on the perfect 
phylogeny problem. 

In this paper we study a long standing conjecture on the necessary and 
sufficient conditions for the compatibility of multi-state characters. 
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Conjecture 1 There exists a junction f(r) such that, for any set C of r -state 
characters, C is compatible if and only if every subset of f(r) characters of C is 
compatible. 

If Conjecture [T] is true, it would follow that we can determine if any set C 
of r-state characters is compatible by testing the compatibility of each subset of 
f[r) characters of C, and, in case of incompatibility, output a subset of at most 
f(r) characters of C that is incompatible. 

A classic result on binary character compatibility shows that /(2) = 2; see 
|5I7I12I17I18] . In 1975, Fitch [9TTU] gave an example of a set C of three 3-state 
characters such that C is incompatible, but every pair of characters in C is 
compatible, showing that /(3) > 3. In 1983, Meacham [T7] generalized this 
example to r-state characters for every r > 3, demonstrating a lower bound 
of f(r) > r for all r; see also [IB]. A recent breakthrough by Lam, Gusficld, 
and Sridhar [TBJ showed that /(3) = 3. While the previous results could lead 
one to conjecture that f(r) = r for all r, Habib and To [13] recently disproved 
this possibility by exhibiting a set C of five 4-state characters such that C is 
incompatible, but every proper subset of the characters in C are compatible, 
showing that /(4) > 5. They conjectured that f(r) > r + 1 for every r > 4. 

The main result of this paper is to prove the conjecture stated in |13j by giving 
a quadratic lower bound on f(r). Formally, we show that for every integer r > 2, 
there exists a set C of r-state characters such that all of the following conditions 
hold. 

1. C is incompatible. 

2. Every proper subset of C is compatible. 

3. |ci = Lu-rn+i- 

Therefore, /(r) > [§J ■ Til + 1 for every r > 2. 

Our proof relies on a new result on quartet compatibility which we believe 
is of independent interest. We show that for every integer n > 4, there exists a 
set Q of quartets over a set of n labels such that all of the following conditions 
hold. 

1. Q is incompatible. 

2. Every proper subset of Q is compatible. 

3. |Q|=|2=2j.ra=2-|+l. 

This represents an improvement over the previous lower bound on the maximum 
cardinality of such an incompatible set of quartets of n — 2 given in |21) . 

We contrast our result on quartet compatibility with a result on the com- 
patibility of triplets: For every n > 3, if R is an incompatible set of triplets over 
n labels, and \R\ > n — 1, then some proper subset of R is incompatible. We 
show this upper bound is tight by exhibiting, for every n > 3, a set of n — 1 
triplets over n labels such that R is incompatible, but every proper subset of R 
is compatible. The results given here on the compatibility of triplets appear to 
have been previously known |20j , but are formally proven here. 
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2 Preliminaries 

Given a graph G, we represent the vertices and edges of G by V(G) and E(G) 
respectively. We use the abbreviated notation uv for an edge {u, v} £ E(G). For 
any e £ E{G), G — e represents the graph obtained from G by deleting edge e. 
For any integer i, we use [i] to represent the set {1, 2, • • • , i}. 

2.1 Unrooted Phylogenetic Trees 

An unrooted phylogenetic tree (or just tree) is a tree T whose leaves are in one 
to one correspondence with a label set L(T), and has no vertex of degree two. 
See Fig. QJa) for an example. For a collection T of trees, the label set of T, 
denoted L(T), is the union of the label sets of the trees in T. A tree is binary 
if every internal (non-leaf) vertex has degree three. A quartet is a binary tree 
with exactly four leaves. A quartet with label set {a, b, c, d} is denoted ab\cd if 
the path between the leaves labeled a and b does not intersect with the path 
between the leaves labeled c and d. 

For a tree T, and a label set L C L(T), the restriction of T to L, denoted 
by T\L, is the tree obtained from the minimal subtree of T connecting all the 
leaves with labels in L by suppressing vertices of degree two. See Fig.QJb) for an 
example. A tree T displays another tree T", if T" can be obtained from T\L(T') 
by contracting edges. A tree T displays a collection of trees T if T displays every 
tree in T '. If such a tree T exists, then we say that T is compatible; otherwise, 
we say that T is incompatible. See Fig. [TJa) for an example. Determining if a 
collection of unrooted trees is compatible is NP-complete [2Tj . 

2.2 Multi-State Characters 

There is also a notion of compatibility for sets of partitions of a label set L. A 
character x on i is a partition of L; the parts of x are called states. If x has at 
most r parts, then x is an r-state character. Given a tree T with i = L(T) and 
a state s of x, we denote by T s (x) the minimal subtree of T connecting all leaves 



Fig. 1: (a) shows a tree T witnessing that the quartets q± = ab\ce, qi = cd\bf ', and 
53 = ad|e/ are compatible; T is also a witness that the characters Xqi — ob\ce\d\f, 
Xq 2 — cd\bf\a\e, and Xg 3 = ad|e/|6|c are compatible; (b) shows T|{a, 6, c, d, e}. 





(a) 



(b) 
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with labels having state s for x- We say that x 1S convex on T, or equivalently 
T displays x, if the subtrees an£ i ^j(x) are vertex disjoint for all states i 

and j of x where i j. A collection C of characters is compatible if there exists 
a tree T on which every character in C is convex. If no such tree exists, then we 
say that C is incompatible. See Fig. [Ha) for an example. The perfect phylogeny 
problem (or character compatibility problem) is to determine whether a given set 
of characters is compatible. 

There is a natural correspondence between quartet compatibility and char- 
acter compatibility that we now describe. Let Q be a set of quartets, n = \L(Q)\, 
and r = n — 2. For each q = ab\cd e Q, we define the r-state character corre- 
sponding to q, denoted Xqi as the character where a and b have state for Xq\ c 
and d have state 1 for Xq\ an d, for each I 6 L(Q) \ {a, b, c, <i}, there is a state s 
of Xq such that € is the only label with state s for character Xq ( see Example [l}. 
We define the set of r-state characters corresponding to Q by Cq = {J qG Q{Xq}- 

Example 1. Consider the quartets and characters given in Fig. QJa): Xqi IS the 
character corresponding to q\ , Xqi is the character corresponding to qi , and Xq 3 
is the character corresponding to q^. 

The proof of the following lemma relating quartet compatibility to character 
compatibility is straightforward, and given in the appendix. 

Lemma 1. A set Q of quartets is compatible if and only if Cq is compatible. 
2.3 Quartet Graphs 

We now give a brief overview of quartet graphs which were introduced in , 
and characterize when a collection of quartets is compatible. 

Let Q be a collection of quartets with label set L. The quartet graph Gq on 
Q is the edge-colored graph defined as follows. 

1. There is a vertex t in Gq for each I in L. 

2. For every quartet q = ab\cd in Q, Gq has two edges ab and cd, both labeled 
by q. We call an edge labeled by q a q-colored edge. 




(a)G Q (b){a,b}^g (c) {e, /} -»• h (d) {g, h} -> i 

Fig. 2: The quartet graph Gq for the set of quartets given in Fig. [T]is shown in 
(a). A complete unification sequence for Gq is shown in (a)-(d). In the figures, 
the unification of vertices U to a new vertex u is denoted U — > u. 
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An example of a quartet graph is given in Fig. [UJa). Let G be any edge 
colored graph. Let U C V(G) such that for any color c, at most one c-colored 
edge is incident on the vertices of U . The unification of vertices U in G is the 
graph G' obtained from G as follows: 

1. Add a new vertex « to G'. 

2. For each c-colored edge vw in G where v € U and w £ U, add a c-colored 
edge uw to G'. 

3. For each c-colored edge vw in G where both v G U and w 6 U, delete all 
c-colored edges from G'. 

4. Delete all vertices in U, and edges incident to vertices in U, from G' . 

See Figures [2] and [3] for examples of unification. A unification sequence for G 
is a sequence Gq=G, Gi, G2, ■ ■ ■ , Gfe of graphs where, for any i > 0, Gi is derived 
from Gj_i by a unification operation. Note that every graph in a unification 
sequence that begins with a quartet graph is also a quartet graph. A unification 
sequence is complete if Gk has no edges. See Fig. |5fa)-(d) for an example of a 
complete unification sequence. The following theorem is from [TTj . 

Theorem 1. A collection Q of quartets is compatible if and only if there exists 
a complete unification sequence for the quartet graph Gq . 

For a quartet graph G, we define the quartet set corresponding to G as the 
set of quartets 

Qg = {ab\cd : there exists edges ab and cd of the same color in G}. 
3 Compatibility of Quartets 

For every s, t > 2, we fix a set of labels L Sit = {ai, 02, ... , a s , b\, 62, ... , &*} and 
define the set 

s-l t-i 

Qs.t = {axbilasbf} U (J (J {aia i+ i\bjb j+ i} 
»=i i=i 

of quartets with label set L Sj t. We denote the quartet ai&i|a s &t by qo, and a 
quartet of the form aia.i+i\bjbj+i by g^j. See Fig.[3Ja) for an illustration. 




(a) The quartet graph for (54,3. (b) {02, 04} — >• u 



Fig. 3: The set of quartets Q4,3- 
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Observation 1 For all s,t > 2, \Q S)t \ = (s - l)(f -1) + 1. 

The proof of the next lemma is straightforward and given in the appendix. 

Lemma 2. Q s ,t is compatible if and only if Qt, s is compatible. 

Lemma 3. For all s,t > 2, Q s ,t is incompatible. 

Proof. We use induction on the size of s + t. 

Base case: Let s — 2, t — 2 and thus, s + t = 4. The set Q2,2 contains only 
the two incompatible quartets qo = ai&i|a2&2 and gi,iaia2|&i&2- Thus, C/2,2 is 
incompatible. 

Induction step: Assume that for every s' <= s and t' < = t, where s > 2, 
t > 2, s' > 2, t 1 > 2 and s+t > s'+i' + l, we have that Q s ',t' is incompatible. To 
prove that Q s ^ is incompatible, we will show that there is no complete unification 
sequence for the quartet graph G = Gq s t . For sake of contradiction, assume that 
there exists a complete unification sequence S for G. Let U be the set of vertices 
unified in the first unification operation of S. 

For each i e [s] and j <E [t] , there exists edges of the same color incident on 
at and bj. Thus, if U contains a vertex a^, it cannot contain a vertex bj. Then, 
w.l.o.g., by Lemma [2j we may assume that U does not contain any vertex bj for 
j G [t\. Also, since the quartet qo — aibi\a s b t £ Q s ,t, it cannot be the case that 
U contains both a\ and a s . W.l.o.g., we assume that U does not contain a%. 

Let a x and a y be the two vertices in U where x is the smallest index over 
all of the vertices in U, and y is the largest index over all of the vertices in U. 
Let G" be the graph resulting from the unification of the vertices of U in G, and 
let u be the unique vertex in V(G') \ V(G). Note that all edges between a x _i 
and a x in G become edges between a x _\ and u in G' , and all edges between 
a y and a y+ i in G become edges between u and a y+ i in G' . See Fig [3] for an 
illustration. Let Q — Qg> ■ Since the unification sequence S = G, G' , G%, ■ ■ ■ ,Gk 
is complete, the sequence G', G\, ■ ■ ■ , Gfc is also complete. Hence, the quartet set 
Q is compatible. 

Let L — L(Qqi). Consider the quartet set Q s _^ y _ x ^ t . Since \U\ > 1, and, by 
assumption, a\ ^ U, we have that 1 < x < y < s. Also, by assumption, t > 2. 
Hence, Q s - y + x .t is well defined. Let h be an injective mapping from L s ^ y+X , t to 
L defined by 

if £ is of the form where 1 < i < x 
if i is of the form where i = x 
if £ is of the form ai where x < i < s — y + x 
if I is of the form bi where 1 < i < t 

We will show that for every quartet q = ^2 G Qs-y+x.t, there exists a 
quartet /i(£i)/i(£ 2 )IM4)M^i) e Q- Since y > x, s - y + x + t < s + t. By the 
inductive hypothesis, Q s - y + Xy t is incompatible. It follows that Q contains an 
incompatible subset of quartets, contradicting that Q is compatible. We have 
the following cases. 



h(£) 
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Case 1: q = go. Then t\ = 01, £2 = &i, £3 = fts-y+z, and £4 = &(. So, h{£\) = 
a±, h(£ 2 ) = h, and ft(£ 4 ) = 6 t . If y = s, then ft(£ 3 ) = ft(a s _j /+a; ) = ft(a s ) = u, 
and a\b\\ub t £ Q. If y < s, then ft(£ 3 ) = h(a s - y+x ) = a s - y+x+y - x = a s , and 
aibi\a s b t £ Q. 

Case 2: g = qij for some 1 < £ < s — y + x and 1 < j < t. Then £1 = cij and 
£2 = fli+i, £3 = &j) and £4 = 6j+i. So, ft(£ 3 ) = bj, and ft(£t) = foj+i- Note 
that since i < s — y + x, we have that if i > x, then y < s. Hence, we have 
the following four possibilities to consider. 

Case 2a: 1 < i < x — 1. Since both i < x and i + 1 < x, h(£\) = at, 
h(£ 2 ) = fli+i, and a^a-i+i 1 e Q. 

Case 2b: i = x— 1. Then ft(^i) = a a: _i, ft(^2) = and aa._iu|&jbj.fi € Q. 

Case 2c: z = x and y < s. Then, h(£i) = h{a x ) = u, ft(^2) = M a z+i) = 
flz+i+y-x = ay+i, and uaj,4-i|6j*6j+i £ Q. 

Case 2d: i > a; and y < s. Then, h(£\) = ft(aj) = aj+y-a and ft(£2) = 
ft(aj-i-i) = ai+i+y-x- Since i > x, it follows that both i + y — x > y and 
i + l+ y — x > y. Since i < s — y + x, it follows that both i + y — x < s 
and i + y - x + 1 <= s. Hence, a i+ y- x a i+ i + y- x \bjbj + i £ Q. 

In every case, we have shown that h{£i)h{£2)\h{£^)h{£^) 6 Q. □ 

Lemma 4. For all s,t >2, and every q £ Q s ,t, Qs,t \ {<?} is compatible. 

Proof. Let q £ Q s .t- Either q = go or 1 = 9x,y f° r some 1 < a; < s and 1 < y < i- 
In every case, we present a tree witnessing that Q s ,t \ {<?} is compatible. 

Case 1. Suppose q = q$. Create the tree T as follows: There is a node for 
each label in L s>t , and two additional nodes a and b. There is an edge ab. For 
every a x £ L Sjt there is an edge a x a. For every b x £ L s j, there is an edge 
b. x b. There are no other nodes or edges in T. See Fig. SJa) for an illustration 
of T. Consider any quartet q £ Q s t \ {go}- Then g = 0^0^+1 \bjbj+\ for some 
1 < i < s and 1 < j < t. Then, the minimal subgraph of T connecting leaves 
with labels in {a,, dj+i, bj, 63+1} is the quartet q. 

Case 2. Suppose q = q x . y for some 1 < x < s and 1 < y < t. Create the tree 
T as follows: There is a node for each label in L s t and six additional nodes 
at, be, I, ft, an, and 6^. There are edges ag£, bi£, £h, hah, and hbh- For every 
a>i £ L s j, there is an edge a.iai if i < x, and an edge a^au if i > x. For every 
bj £ L s t there is an edge bjbg if j < x, and an edge bjbh if j > y. There 
are no other nodes or edges in T. See Fig. Sfb). Now consider any quartet 
q £ Qs.t\{q x , y }- Either q = q or q = q t j where i ^ x or j ^= y. If q = q , then 
the minimal subgraph of T connecting leaves with labels in {ai, 61, a s , bt} 
is the subtree of T induced by the nodes in {a±, at, £, bt, b\, a s , ah, ft, bh, bt}. 
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ai 




a 2 » 






% 




bt 


by 





(a) Case 1: T displays Q Stt \ {go}- 



(b) Case 2: T displays Q S:t \ {<li,j} 



Fig. 4: Illustrating the proof of Lemma 0] 



Suppressing all degree two vertices results in a tree that is the same as qo. 
So T displays q. So assume that q = aidi+i\bjbj+i where i ^ x or j ^ y. We 
define the following subset of the nodes in T: 



V = < 



{ai, ai + i,a,£ 7 £, be, bj,bj + i} 
{ai, ai + i,ai,t, b y , be, h, bh, b y +i} 
{ai, ai + i,ae,l, h, bh, bj,bj + i} 
{a x , ae, I, h, ah, a x+ \,bi>, bj,bj + {\ 
{a x , ae, I, h, ah, a x+ i,bh, bj,bj + i} 
{aj,a J+ i,a h , h,£, be, bj,b J+ i} 
{ a j, a j+i, a h, h, by, be, I, bh, b y +i} 
k {a 3 ,a 3+ i,a h , h, b h , b 3 ,b j+ i} 



if i < x and j < y, 
if i < x and j = y, 
if i < x and j > y, 
if i = x and j < y, 
if i = x and j > y, 
if i > x and j < y, 
if i > x and j = y, 
if i > x and j > y. 



Now, the subgraph of T induced by the nodes in V is the minimal subgraph 
of T connecting leaves with labels in q. Suppressing all degree two vertices 
results in a tree that is the same as q. Hence, T displays q. □ 



Theorem 2. For every integer n > A, there exists a set Q of quartets over n 
taxa such that all of the following conditions hold. 

1. Q is incompatible. 

2. Every proper subset of Q is compatible. 



Proof. By letting s = [^J and t = \^~\ , it follows from Observation [T] and 
Lemmas [3] and H that Q M is of size ([f J - 1) • ([f ] - 1) + 1 = Lnr J ' V ± T L ~\ + 1 
and is incompatible, but every proper subset of Q s ,t is compatible. □ 

The following theorem allows us to use our result on quartet compatibility 
to establish a lower bound on f(r). 

Theorem 3. Let Q be a set of incompatible quartets over n labels such that 
every proper subset of Q is compatible, and letr — n — 2. Then, there exists a set 
C of \Q\ r -state characters such that C is incompatible, but every proper subset 
of C is compatible. 
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Proof. We claim that Cq is such a set of incompatible r-state characters. Since 
for two quartets gi,Q2 £ Q, Xqi Xqn it follows that \Cq\ = \Q\. Since Q is 
incompatible, it follows by Lemma [I] that Cq is incompatible. Let C be any 
proper subset of C. Then, there is a proper subset Q' of Q such that C = Cq>. 
Then, since Q' is compatible, it follows by Lemma Q] that C is compatible. □ 

Theorem [2] together with Theorem [3] gives the following theorem. 

Theorem 4. For every integer r > 2, there exists a set C of r-state characters 
such that all of the following hold. 

1. C is incompatible. 

2. Every proper subset of C is compatible. 



Proof. By Theorem [5] and Observation [TJ there exists a set Q of |_§J • Til + 1 
quartets over r + 2 labels that that are incompatible, but every proper subset is 



A rooted phylogenetic tree (or just rooted tree) is a tree whose leaves are in one 
to one correspondence with a label set L(T), has a distinguished vertex called 
the root, and no vertex other than root has degree two. See Fig. [5[a) for an 
example. A rooted tree is binary if the root vertex has degree two, and every 
other internal (non-leaf) vertex has degree three. A triplet is a rooted binary 
tree with exactly three leaves. A triplet with label set {a, b, c} is denoted ab\c if 
the path between the leaves labeled a and b does include the root vertex. For a 
tree T, and a label set L C L(T), let T" be the minimal subtree of T connecting 
all the leaves with labels in L. The restriction of T to L, denoted by T\L, is the 
rooted tree obtained from T" by distinguishing the vertex closest to the root of T 
as the root of T", and suppressing every vertex other than the root having degree 



3. ici = L§j-r§i+i- 




□ 



4 Compatibility of Triplets 





A A A 

a b c d e f 
(a) A tree T witnessing that the triplets ab\c, 



(b) T\{a,b,c,e} 



a fj 



c e 



de\b, e/|c, and ec\b are compatible 



Fig. 5: Rooted Phylogenetic Trees 
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two. A rooted tree T displays another rooted tree T' if T" can be obtained from 
T\L(T') by contracting edges. A rooted tree T displays a collection of rooted 
trees TUT displays every tree in T ■ If such a tree T exists, then we say that 
T is compatible; otherwise, we say that T is incompatible. Given a collection of 
rooted trees T, it can be determined in polynomial time if T is compatible [5] . 

The following theorems follow from the connection between collections of 
unrooted trees with at least one common label across all the trees, and collections 
of rooted trees [2T] . 

Theorem 5. Let Q be a collection of quartets where every quartet in Q shares 
a common label £. Let R be the set of triplets such that there exists a triplet ab\c 
in R if and only if there exists a quartet ab\c£ in Q. Then, Q is compatible if 
and only if R is compatible. 

Let R be a collection of triplets. For a subset S C L(R), we define the graph 
[R, S] as the graph having a vertex for each label in S, and an edge {a, b} if and 
only if ab\c € R for some c £ S. The following theorem is from [4]. 

Theorem 6. A collection R of rooted triplets is compatible if and only if [R, S] 
is not connected for every S C L{R) with \S\ > 3. 

Corollary 2. Let R be a set of rooted triplets such that R is incompatible but 
every proper subset of R is compatible. Then, [R,L(R)] is connected. 

We now contrast our result on quartet compatibility with a result on triplets. 

Theorem 7. For every n > 3, if R is an incompatible set of triplets over n 
labels, and \R\ > n— 1, then some proper subset of R is incompatible. 

Proof. For sake of contradiction, let R be a set of triplets such that R is incom- 
patible, every proper subset of R is compatible, \L(R)\ — n, and \R\ > n—1. The 
graph [R, L(R)] will contain n vertices and at least n edges. Since each triplet in 
R is distinct, there will be a cycle C of length at least three in [R, L(R)]. Since 
R is incompatible but every proper subset of R is compatible, by Corollary [21 
[R, L(R)] is connected. 

Consider any edge e in the cycle C. Let t be the triplet that contributed edge e 
in [R,L(R)]. Let R' = R\t. Since the graph [R,L(R)]-eis connected, [R',L(R')] 
is connected. By theorem [6j R' is incompatible. But R' C R, contradicting that 
every proper subset of R is compatible. □ 

To show the bound is tight, we first prove a more restricted form of Theorem[2j 

Theorem 8. For every n > 4, there exists a set of quartets Q with \L(Q)\ = n, 
and a label £ G L(Q), such that all of the following hold. 

1. Every q € Q contains a leaf labeled by £. 

2. Q is incompatible. 

3. Every proper subset of Q is compatible. 
I \Q\=n-2. 
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Proof. Consider the set of quartets Q2,n-2- From Lemmas [3] and [U Q2.-n.-2 is 
incompatible but every proper subset of Qi,n-2 is compatible. The set Q2.-n.-2 
contains exactly n — 2 quartets. From the construction, there are two labels in 
L which are present in all the quartets in Q2.r1.-2- Set one of them to be I. □ 

The following is a consequence of Theorem [8] and Theorem [5[ 

Corollary 3. For every n > 3, there exists a set R of triplets with L(R) = n 
such that all of the following hold. 

1. R is incompatible. 

2. Every proper subset of R is compatible. 

3. \R\ =n-l. 

The generalization of the Fitch-Meacham examples given in |16) can also be 
expressed in terms of triplets. For any r > 2, let L = {a, 61, &2, • • ■ , K}- Let 



Let Q — {ab\c£ : ab\c £ R r } for some label I ^ L. The set Cq of r-state 
characters corresponding to the quartet set Q is exactly the set of characters 
built for r in [TB]. In the partition intersection graph of Cq, (following the 
terminology in [16] ) labels I and a correspond to the end cliques and the rest of 
the r labels {61, &2, • • • ,K} correspond to the r tower cliques. From Lemma [TJ 
and Theorem O R r is compatible if and only of Q is compatible. 

5 Conclusion 

We have shown that for every r > 2, f(r) > [§J • [§] + 1, by showing that for 
every n > 4, there exists an incompatible set Q of L 2 ^] • T^r^l + 1 quartets over 
a set of n labels such that every proper subset of Q is compatible. Previous results 
show that our lower bound on f(r) is tight for r = 2 and r = 3 [517112116117118] . 
We give the following conjecture. 

Conjecture 2 For every r > 2, f(r) = [§J ■ [§] + 1. 

Note that, due to Theorem [31 a proof of Conjecture [5] would also show that 
the number of incompatible quartets given in the statement of Theorem [2] is 
also as large as possible. Another direction for future work is to show an upper 
bound on the function f(r), which would prove Conjecture [TJ For quartets, we 
have a trivial upper bound of (™J on the cardinality of a set of quartets over 
n labels such that every proper subset is compatible. However, the question for 
multi-state characters remains open. 
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Technical Appendix 

5.1 Proof of Lemma [T] 

Proof. Let T be any tree. It suffices to show that T displays q — ab\cd if and 
only if Xq is convex on T. (=>) Suppose T displays q. It follows that the path 
from the leaf labeled a to the leaf labeled b does not intersect the path from the 
leaf labeled c to the leaf labeled d. Every other state s of Xq has only one label i s 
with state s for Xq] hence the minimal subtree connecting all leaves with labels 
having state s for x is a single leaf. It follows that Xq is convex on T. (<=) Now 
suppose that Xq is convex on T. Then the minimal subtree containing both the 
leaf labeled a and the leaf labeled b is pair wise disjoint from the minimal subtree 
containing both the leaf labeled c and the leaf labeled d. It follows that the path 
from the leaf labeled a to the leaf labeled b does not intersect the path from the 
leaf labeled c to the leaf labeled d. Hence T displays q. □ 

5.2 Proof of Lemma [2] 

Proof. We define the function g : L Sjt — > L tjS by 




bi if £ is of the form a, for some 1 < i < s 
di if i is of the form bi for some 1 < i < t. 



It is straightforward to verify that iil^&s&i € Q s ,t if and only if g{l\)g{l2)\g{ls)g{l4) G 
Qt,s- The theorem follows. □ 



